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Abstract 

In pattern mining, the main challenge is the exponential explosion 
of the set of patterns. Typically, to solve this problem, a constraint for 
pattern selection is introduced. One of the first constraints proposed in 
pattern mining is support (frequency) of a pattern in a dataset. Fre¬ 
quency is an anti-monotonic function, i.e., given an infrequent pattern, 
all its superpatterns are not frequent. However, many other constraints 
for pattern selection are not (anti-)monotonic, which makes it difficult 
to generate patterns satisfying these constraints. In this paper we in¬ 
troduce the notion of projection-antimonotonicity and d-Eocpia algorithm 
that allows efficient generation of the best patterns for some nonmonotonic 
constraints. In this paper we consider stability and A-measure, which are 
nonmonotonic constraints, and apply them to interval tuple datasets. In 
the experiments, we compute best interval tuple patterns w.r.t. these 
measures and show the advantage of our approach over postfiltering ap¬ 
proaches. 

KEYWORDS: Pattern mining, nonmonotonic constraints, interval tu¬ 
ple data 


1 Introduction 

Interestingness measures were proposed to overcome the problem of combina¬ 
torial explosion of the number of valid patterns that can be discovered in a 
dataset M- For example, pattern support, i.e., the number of objects cov¬ 
ered by the pattern, is one of the most famous measures of pattern quality. 

*The final publication is available at link.springer.com 
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In particular, support satisfies the property of anti-monotonicity (aka “a pri¬ 
ori principle”), i.e., the larger the pattern is the smaller the support is HI Hi- 
Many other measures can be mentioned such as utility constraint [24] , pattern 
stability m\m, pattern leverage m, margin closeness [13], MCCS m , cosine 
interest [3], pattern robustness m , etc. 

Some of these measures (e.g., support, robustness for generators m , or 
upper bound constraint of MCCS usd are “globally anti-monotonic”, i.e., for 
any two patterns ICY we have A4(X) > M.(Y ), where M is a measure and 
C denotes the (subsumption) order relation on patterns. When a measure is 
anti-monotonic, it is relatively easy to find patterns whose measure is higher 
than a certain threshold (e.g., patterns with a support higher than a threshold). 
In contrast some other measures are called “locally anti-monotonic”, i.e., for any 
pattern X there is an immediate subpattern Y -< X such that M(Y) > M.{X). 
Then the right strategy should be selected for traversing the search space, e.g., 
a pattern Y should be extended only to patterns X such that A4(Y) > A4(X). 
For example, for “locally anti-monotonic” cosine interest j4], the extension of a 
pattern Y consists in adding only attributes with a smaller support than any 
attribute from Y. The most difficult case for selecting valid patterns occurs when 
a measure is not locally anti-monotonic. Then, valid patterns can be retained 
by postfiltering, i.e., finding a (large set of) patterns satisfying an antimonotone 
constraint and filtering them w.r.t. the chosen nonmonotonic measure (i.e., 
neither monotonic nor anti-monotonic) [151 1131117] , or using heuristics such as 
leap search [22] or low probability of finding interesting patterns in the current 
branch [19]. 

Most of the measures are only applicable to one type of patterns, e.g., pat¬ 
tern leverage or cosine interest can be applied only to binary data since their 
definitions involve single attributes. “Pattern independent measures” usually 
relies on support of the pattern and/or on support of other patterns from the 
search space. In particular, support, stability [10], margin-closeness m and 
robustness HU are pattern independent measures. In this paper we work with 
interval tuple data, where only pattern independent measures as well as specific 
measures for interval tuples can be applied. In addition, given a measure, it 
can be difficult to define a good threshold. Thus various approaches for finding 
top-IF patterns were introduced la Bung, with the basic idea to automatically 
adjust the threshold for a measure M. 

In this paper we introduce a new algorithm D-Eocpia, i.e., Sofia, for “Search¬ 
ing for Optimal Formal Intents Algorithm” for a interestingness threshold H, 
for extracting the best patterns of a kind, e.g., itemsets, interval tuples, strings, 
graph patterns, etc. D-Eocpia algorithm is applicable to a class of measures 
called “projection-antimonotonic measures” or more precisely “measures anti¬ 
monotonic w.r.t. a chain of projections”. This class includes globally anti¬ 
monotonic measures such as support, locally anti-monotonic measures such as 
cosine interest and some of the nonmonotonic measures such as stability or ro¬ 
bustness of closed patterns. The main novelty of this paper is 'd-Eocpi.a, a new 
efficient algorithm for finding best patterns of different kinds w.r.t. projection- 
antimonotonic measures which constitutes a rather large class of measures. 
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The remaining of the paper is organized as follows. The formalization of the 
current approach is based on Formal Concept Analysis (FCA) [6] and pattern 
structures [5] which are introduced in Section [2j Then, t)-Eo(pia algorithm is 
detailed in Section[3]first for an arbitrary measure and second for the A-measure. 
Experiments and a discussion are proposed in Section [2 before conclusion. 


2 Data Model 

2.1 FCA and Pattern structures 

Formal Concept Analysis (FCA) is a formalism for knowledge discovery and 
data mining thanks to the design of concept lattices [B]. It is also convenient for 
describing models of itemset mining, and, since [14] . lattices of closed itemsets 
(i.e., concept lattices) and closed descriptions are used for concise representation 
of association rules. For more complex data such as sequences and graphs one 
can use an extension of the basic model, called pattern structures [5]. With 
pattern structures it is possible to define closed descriptions and to give a concise 
representation of association rules for different descriptions with a natural order 
(such as subgraph isomorphism order) IT:, 181. 

A pattern structure is a triple (G,(D,n),5), where G is a set of objects, 
( D , n) is a complete meet-semilattice of descriptions and 5 : G —>• D maps an 
object to a description. 

The intersection n gives the similarity of two descriptions. Standard FCA 
can be presented in terms of a pattern structure. A formal context ( G,M,I ), 
where G is a set of objects, M is a set of attributes and / C G x M an incidence 
relation giving information about attributes related to objects, is represented 
as a pattern structure (G, (p(M), fl), <5), where (p(M), fl) is a semilattice of 
subsets of M with PI being the set-theoretical intersection. If x = {a, b, c} and 
y = { a , c, d}, then x\~\y = x P\y = { a , c}. The mapping S : G —>• p(M) is given 
by 5(g) = {m £ M | ( g,m ) £ 1} and returns the description of a given object 
as a set of attributes. 

The following mappings or diamond operators give a Galois connection be¬ 
tween the powerset of objects and descriptions: 

:= | | <5(jg), for A C G 

g£A 

,f := {g £ G | d E 5(g)}, for d £ D 

Given a subset of objects A, A 0 returns the description which is common 
to all objects in A. Given a description d , d° is the set of all objects whose 
description subsumes d. A partial order C (subsumption) on descriptions from 
D is defined w.r.t. the similarity operation ri: c C rf O c n d = c, and c is 
subsumed by d. 

A pattern concept of a pattern structure (G, ( D , n), 5) is a pair (A, d), where 
A C G, called pattern extent and d £ D, called pattern intent, such that A° = d 
and df = A. A pattern extent is a closed set of objects, and a pattern intent is 
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a closed description, e.g., a closed itemset when descriptions are given as sets 
of items (attributes). As shown in [Tlj . descriptions closed in terms of counting 
inference (which is a standard data mining approach), such as closed graphs [23] , 
are elements of pattern intents. 

A pattern extent corresponds to the maximal set of objects A whose descrip¬ 
tions subsume the description d, where d is the maximal common description 
for objects in A. The set of all pattern concepts is partially ordered w.r.t. inclu¬ 
sion on extents, i.e., (A\,d\) < (A^,^) iff A\ C A 2 (or, equivalently, E g?i), 
making a lattice, called pattern lattice. 

2.2 Interval pattern structure 

A possible instantiation of pattern structures is interval pattern structures intro¬ 
duced to support efficient processing of numerical data without binarization [5]. 
Given k numerical or interval attributes whose values are of the form [a, b ], 
where a, 6 G IR, the language of a pattern space is given by tuples of intervals of 
size k. For simplicity, we denote intervals of the form [a, a] by a. 

Figure [la] exemplifies an interval dataset. It contains 6 objects and 2 at¬ 
tributes. An interval as a value of an attribute corresponds to an uncertainty 
in the value of the attribute. For example, the value of m\ for (72 is known 
exactly, while the value of m 2 is lying in [1,2]. Given this intuition for intervals 
it is natural to define similarity of two intervals as their convex hull, since by 
adding new objects one increases the uncertainty. For example, for g\ the value 
of m 1 is 0 , while for g e it is 1 , thus given the set {gi,ge}, the uncertainty of 
m 1 in this set is [0,1], i.e., the similarity of <?i and g§ w.r.t. m\ is [0,1]. More 
formally, given two intervals [a, b\ and [c, d], the similarity of these two intervals 
is given by [a, b] n [c, d] = [min(a, c), max(i, d)}. Given a tuple of intervals, the 
similarity is computed component-wise. For example, g\ n g^ = ([0,1]; [0, 2]). 
Reciprocally, ([0,1]; [0, 2]) = { 5 - 1 , 52 , • - • ,ge}- 

The resulting concept lattice is shown in Figure [Tb] Concept extents are 
shown by indices of objects, intents are given in angle brackets, the numbers 
on edges and on concepts are related to interestingness of concepts and will be 
described in the next subsection. 


2.3 Stability index of a concept 


For real datasets, the number of patterns can be very large, even computing 
the number of closed patterns is a ^P-complete problem [9] . Different mea¬ 
sures were tested for selecting most interesting patterns, such as stability Uni- 
Stability measures the independence of a concept intent w.r.t. randomness in 
data. 

Given a concept C, concept stability Stab(C) is the relative number of subsets 
of the concept extent (denoted by Ext(C)), whose descriptions, i.e., the result 
of (•)* is equal to the concept intent (denoted by Int(C)). 


Stab(C) := 


Kg € p(Ext(C)) | = Int(C)}| 

|p(Ext(C))| 


( 1 ) 
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(a) An interval context, (b) An interval concept lattice with corresponding sta¬ 
bility indexes. Objects are given by their indices. 


Figure 1: A formal context and the corresponding lattice. 


Here p(P) is the powerset of P. The larger the stability, the more objects 
can be deleted from the context without affecting the intent of the concept, i.e., 
the intent of the most stable concepts is likely to be a characteristic pattern of 
a given phenomenon and not an artifact of a dataset. 

We say that a concept is stable if its stability is higher than a given threshold 
0; a pattern p is stable if there is a concept in the lattice with p as the intent 
and the concept is stable. 

Example 1 Figure |7&] shows a lattice for the context in Figure I lal Concept 
extents are given by their indices, i.e., { 51 , 32 } Is given by 12. The extent of 
the highlighted concept C is Ext(C) = { 32 , 33 , 34 }, thus, its powerset contains 2 3 
elements. Descriptions of 2 subsets o/Ext(C) {{ 34 } and 0} are different from the 
intent of C, Int(C) = {TO 3 }, while all other subsets of Ext (C) have a common 
set of attributes equal to (0; [1,2]). So, Stab(C) = 2 2 7 2 = 0.75. Stability of 
other concepts is shown in brackets. It should be noticed that stability of all 
comparable patterns for Int(C) in the lattice is smaller than the stability of C, 
which highlights the nonmonotonicity of stability. 

Concept stability is closely related to the robustness of a closed pattern HZj. 
Indeed, robustness is the probability of a closed pattern to be found in a subset 
of the dataset. To define this probability, the authors define a weight for ev¬ 
ery subset given as a probability of obtaining this subset by removing objects 
from the dataset, where every object is removed with probability a , e.g., given 
a subset of objects X C G, the probability of the induced subset is given by 
p(D a = (X,(D,r\),5)) = al^l(l — a)l G \ x L Stability in this case is the ro¬ 
bustness of closed pattern if the weights of subsets of the dataset are equal to 

2-l°l. 

The problem of computing concept stability is #P-complete [TO]. A fast 
computable stability estimate was proposed in [3] , where it was shown that this 
estimate ranks concepts almost in the same way as stability does. In particular, 
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Stab(C) <1 — 2 A ( c ), where A(C) = min|Ext(C) \Ext(X>)|, i.e., the minimal 

difference in supports between concept C and all its nearest subconcepts. For a 
threshold 9, patterns p with A (p) > 8 are called A-stable patterns. 

Example 2 Consider the example in Figure [7J Every edge in the figure is 
labeled with the difference in support between the concepts this edge connects. 
Thus, A of a pattern is the minimum label of the edges going down from the 
concept. The value A(({g- 2 , g 3 , 34 }; (0; [1,2]))) is equal to 2. Another example is 
A((G; ([0,1]; [0, 2]))) = 2. For this example we can also see that A-measure is 
not anti-monotonic either. 

A-measure is related to the work of margin-closeness of an itemset [T3]. In 
this work, given a set of patterns, e.g., frequent closed patterns, the authors 
rank them by the minimal distance in their support to the closest superpattern 
divided over the support of the pattern. In our case, the minimal distance is 
exactly the A-measure of the pattern. 

Stability and A-measure are not anti-monotonic but rather projection-antimo¬ 
notonic. Patterns w.r.t. such kind of measures can be mined by a specialized 
algorithm introduced in Section [3] But before we should introduce projections 
of pattern structures in order to properly define projection-antimonotonicity 
and the algorithm. 

2.4 Projections of Pattern Structures 

The approach proposed in this paper is based on projections introduced for 
reducing complexity of computing pattern lattices [5] . 

A projection if : D —> D is an “interior operator”, i.e., it is (1) mono¬ 
tonic (a ’■ C y =$■ ip{x) Q ip{y)), (2) contractive {ip(x) C x) and (3) idempotent 
(ip(ip(x)) = ip{x)). A projected pattern structure ip((G, (D,n),5)) is a pattern 
structure (G, (A/,, n^,), if o 5), where D^ = ip(D) = {d £ D \ 3d* £ D : ip(d*) = 
d} and Vx, y £ D, x n^, y := ip(x n y). 

Example 3 Consider the example in Figure QJ If we remove a column corre¬ 
sponding to an attribute, e.g., the attribute m 2 , from the context in Figure ITal 
we define a projection, given by ip(([a, b\; [c, d]}) = ([a, 6]; [— 00 ,+ 00 ]), meaning 
that no value of m 2 is taken into account. 

Given a projection if we call if(D) = {d £ D \ if{d) = d} the fixed set of 
ip. Note that, if ip(d) ^ d, then there is no other d such that ip{d) = d because 
of idempotency of projections. Hence, any element outside the fixed set of the 
projection ip is pruned from the description space. Given the notion of a fixed 
set we can define a partial order on projections. 

Definition 1 Given a pattern structure P = (G, (_D,n),<5) an d t wo projections 
ipi and ip 2 , we say that ip\ is simpler than ip 2 (ip 2 is more detailed than ipi), 
denoted by ipi < ip 2 , if ip\{D) C ip 2 {D), i.e., ipi prunes more descriptions than 
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Our algorithm is based on this order on projections. The simpler a projection 
if) is, the less patterns we can find in ^>(P), and the less computational efforts 
one should take. Thus, we compute a set of patterns for a simpler projection, 
then we remove unpromising patterns and extend our pattern structure and the 
found patterns to a more detailed projection. This allows us to reduce the size 
of patterns within a simpler projection in order to reduce the computational 
complexity of more detailed projection. 

2.5 Projections of Interval Pattern Structures 

Let us first consider interval pattern structures with only one attribute m. Let us 
denote by W = {u>i, • • • , w\w \} all possible values of the left and right endpoints 
of the intervals corresponding to the attribute in a dataset, so that w\ < W 2 < 
■ ■ ■ < w\w\ ■ By reducing the set W of possible values for the left or the right end 
of the interval we define a projection. For example, if {u>i} is the only possible 
value for the left endpoint of an interval and {icivv|} is the only possible value 
of the right endpoint of an interval, then all interval patterns are projected to 
[uq, Wiyiq]. Let us consider this in more detail. 

Let two sets L,R C W such that W\ £ L and w\ w \ £ R be constraints on 
possible values on the left and right endpoints of an interval, respectively. Then 
a projection is defined as follows: 

tpm[L,R] ([a, b]) = [max{7 € L\l < a}, min{r G R\r > &}]. (2) 

Requiring that w± G L and w\w\ £ R we ensure that the sets used for 
minimal and maximal functions are not empty. It is not hard to see that © 
is a projection. The projections given by © are ordered w.r.t. simplicity 
(Definition©. Indeed, given L\ C L and i?i C R , we have ip m [L i.fli] < 4 > m[L,R], 
because of inclusion of fixed sets. Let us notice that a projection W m [w,w] does 
not modify the lattice of concepts for the current dataset, since any interval for 
the value set W is possible. We also notice that a projection is defined 

for one interval, while we can combine the projections for different attributes in 
a tuple to a single projection for the whole tuple ipmi[L 1 ,R 1 }m 2 [L 2 ,R 2 \...- 

Example 4 Consider example in Figure[l\ Let us consider a projection 

V’mi[{0,l},{l}]m 2 [{0,2},{0,2}]- 

The fixed set of this projection consists of {[0,1], 1} x {0, 2, [0, 2]}, i.e., 6 inter¬ 
vals. Let us find the projection of ( g- 2 )° = (0; [1,2]} in a component-wise way: 
V ; m 1 [{o I i} ) {i}](0) = [0,1], since 0 is allowed on the left endpoint of an interval but 
not allowed to be on the right endpoint of an interval; V'm 2 [{o, 2 },{o, 2 }]([1,2]) = 
[0, 2] since 1 is not allowed on the left endpoint of an interval. Thus, 

^mi[{O,l},{l}]m 2 [{O,2},{O,2}]((0; [1, 2]}) = ([0,1]; [0, 2]). 

The lattice corresponding to this projection is shown in Figure © 
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Figure 2: Projected lattice from example in Figure |T| by projection 
^mi[{ 0 ,i},{i}]m 2 [{ 0 , 2 },{ 0 , 2 }]- See Example m 

3 'd-Xocpia Algorithm 

3.1 Anti-monotonicity w.r.t. a Projection 

Our algorithm is based on the projection-antimonotonicity, a new idea intro¬ 
duced in this paper. Many interestingness measures for patterns, e.g., stability, 
are not (anti-)monotonic w.r.t. subsumption order on patterns. A measure At 
is called anti-monotonic, if for two patterns q C p, M.{q) > A 4{p)- For instance, 
support is a anti-monotonic measure w.r.t. pattern order and it allows for effi¬ 
cient generation of patterns with support larger than a threshold pEnum]. The 
projection-antimonotonicity is a generalization of standard anti-monotonicity 
and allows for efficient work with a larger set of interestingness measures. 

Definition 2 Given a pattern structure P and a projection ip, a measure At is 
called anti-monotonic w.r.t. the projection ip, if 

(Vp G G P ,ip(q) =p) M^,(p) > M(q ), (3) 

where Af^(p) is the measure A i of a pattern p computed in ip{P). 

Here, for any pattern p of a projected pattern structure we check that a 
preimage q of p for ip has a measure smaller than the measure of p. It should 
be noticed that a measure At for a pattern p can yield different values if A i is 
computed in P or in ^(P). Thus we use the notation for the measure Af 
computed in ip(P). The property of a measure given in Definition [2] is called 
projection-antimonotonicity. 

It should be noticed that classical anti-monotonic measures are projection- 
antimonotonic for any projection. Indeed, because of contractivity of ip (ip(p) E 
p), for any anti-monotonic measure one has A i(ip(p)) > Af(p). This definition 
covers also the cases where a measure Af is only locally anti-monotonic, i.e., 
given a pattern p there is an immediate subpattern q -< p such that A j(q) > 
M(p), see e.g., the cosine interest of an itemset, which is only locally anti¬ 
monotonic [5- Moreover, this definition covers also some measures that are not 
locally anti-monotonic. As we mentioned in Examples |T| and [2] stability and A- 
measure are not locally anti-monotonic. However, it can be shown that they are 
anti-monotonic w.r.t. any projection [2]. Moreover, following the same strategy 
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one can prove that robustness of closed patterns from m is also anti-monotonic 
w.r.t. any projection. In particular, the robustness of closed patterns defines a 
anti-monotonic constraint w.r.t. any projection. 

Thus, given a measure At anti-monotonic w.r.t. a projection if, if p is a 
pattern such that M^(p) < 9, then M(q) <9 for any preimage q of p for if. 
Hence, if, given a pattern p of if(P), one can find all patterns q of P such that 
if(q) = Pi if i s possible to first find all patterns of ip{P) and then to filter them 
w.r.t. and a threshold, and finally to compute the preimages of filtered 

patterns. It allows one to cut earlier unpromising branches of the search space 
or adjust a threshold for finding only a limited number of best patterns. 

3.2 Anti-monotonicity w.r.t. a Chain of Projections 

However, given just one projection, it can be hard to efficiently discover the 
patterns, because the projection is either hard to compute or the number of 
unpromising patterns that can be pruned is not high. Hence we introduce a 
chain of projections ifo < ifi <■■■< ifk = ^, where a pattern lattice for ^o(P) 
can be easily computed and 1 is the identity projection, i.e., (Vx)l(a;) = x. 
For example, to find frequent itemsets, we typically search for small frequent 
itemsets and then extend them to larger ones. This corresponds to extension to 
a more detailed projection. 

Definition 3 Given a pattern structure P and a chain of projections ifo < 
ifi < ■ ■ ■ < ifk = 1, a measure A4 is called anti-monotonic w.r.t. the chain of 
projections if M. is anti-monotonic w.r.t. all ifi for 0 <i<k. 

Example 5 Let us construct a chain of projections satisfying for the ex¬ 
ample in Figure{l J The value set for the first attribute is W\ = {0,1} and the 
value set for the second is W 2 = {0,1,2}. Let us start the chain from a pro¬ 
jection if 0 = '0mi[{o},{i}]m 2 [{o},{2}]• This projection allows only for one pattern 
([0,1]; [0,2]), i.e., the concept lattice is easily found. Then we increase the com¬ 
plexity of a projection by allowing more patterns. For example, we can enrich 
the first component of a tuple without affecting the second one, i.e., a projection 
if 1 = if mi [{o,i}.{o,i}]m 2 [{o},{ 2 }] ■ This projection allows for 3 patterns, i.e., any 
possible interval of the first component and only one interval [ 0 , 2 ] for the second 
component. Let us notice that it is not hard to find preimages for ifo in ifi(D). 
Indeed, for any pattern p from if a (D) one should just modify either the left side 
of the first interval of p by one value, or the right side of the first interval of p. 

Then we can introduce a projection that slightly enrich the second com¬ 
ponent of a tuple, e.g., if 2 = $ mi [{o,i},{o,i}]m a [{o,i},{i, 2 }] and finally we have 
if 3 = V’mi[Wi,Wi]m 2 [w 2 ,w 2 ]• Finding preimages in this chain is not a hard prob¬ 
lem, since on every set we can only slightly change left and/or right side of the 
second interval in a tuple. Thus, starting from a simple projection and making 
transitions from one projection to another, we can cut unpromising branches 
and efficiently find the set of interesting patterns. 
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3.3 Algorithms 


i 


2 

3 

4 

5 

6 


Data: A pattern structure P, a chain of projections 

= {”00 5 0 i, * * * , 0 /c}, a measure At anti-monotonic for the chain 
and a threshold 0 for At. 

Function ExtendProjection(0 Vi-\) 

Data: i is the projection number to which we should extend 

(0 < * < k), 8 is a threshold value for M , and Vi -1 is the set 
of patterns for the projection ipi-\. 

Result: The set V, of all patterns with the value of measure At 
higher than the threshold 9 for ipi. 

Vi<— 0 ; 

/* Put all preimages in 0i(P) for any pattern p */ 

foreach p E 7^_i do 
| 7^ ^ — Vi U Preimages ( i,p ) 

/* Filter patterns in Vi to have a value of Af higher than 

e */ 


7 

8 
9 

10 


11 

12 

13 

14 

15 


foreach p E Vi do 

if M^^p) < 6 then 

| Vi<— V t \{p} 

Function Algorithm_0-Eocpi,a 

Result: The set V of all patterns with a value of At higher than the 
threshold 9 for P. 

/* Find all patterns in i/jq (P) with a value of At higher 
than 9 */ 

V < — FindPatterns (0, ^o)! 

/* Run through out the chain U/ and find the patterns for 

M p ) */ 

foreach 0 < i < k do 
I V< —ExtendProjectionCij^jT 5 ); 


Algorithm 1: The fl-Eocpia algorithm for finding patterns in P with a 
value of a measure At higher than a threshold 9. 

Given a measure anti-monotonic w.r.t. a chain of projections, if we are able 
to find all preimages of any element in the fixed set of ipi that belong to a 
fixed set of ipi+i, then we can find all patterns of P with a value of At higher 
than a given threshold 9. We call this algorithm fl-Eocpia and its pseudocode 
is given in Algorithm [TJ In lines 11-12 we find all patterns for ^>o(P) satisfying 
the constraint that a value of At is higher than a threshold. Then in lines 13- 
15 we iteratively extend projections from simpler to more detailed ones. The 
extension is done by constructing the set Vi of preimages of the set Vi-\ (lines 
2-5) and then by removing the patterns that do not satisfy the constraint from 
Vi (lines 6-9). 

The algorithm is sound and complete, since first, a pattern p is included 
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into the set of preimages of p (if(jp) = p) and second, if we remove a pattern 
p from the set "P, then the value M.[p) <6 and, hence, the measure value of 
any preimage of p is less than 6 by the projection-antimonotonicity of A4. The 
worst case time complexity of d-Socpia algorithm is 

T(f>-£ocpioc) = 1 (FindPatterns(il)o))+ 

+ k • max \Pi\ ■ (r(Preimages) + (4) 

0 <i<k 

where k is the number of projections in the chain, TT(A’) is time for computing 
operation X. Since projection ifo can be chosen to be very simple, in a typical 
case the complexity of FindPatterns(6,ipo) can be low or even constant. The 
complexities of Preimages and M depend on the measure, the chain of projec¬ 
tions, and the kind of patterns. In many cases max \Vj\ can be exponential in 

the size of the input, because the number of patterns can be exponential. It can 
be a difficult task to define the threshold 6 such that the maximal cardinality of 
Vi is not larger than a given number. This can be solved by an automatically 
adjustment of the threshold 9, which is not discussed here. 

3.4 'd-Socpioc Algorithm for Interval Tuple Data 

In this subsection we consider a pattern structure IK = (G, (£>/, n), 8), where Dj 

is a semilattice of interval tuple descriptions. We say that every component of 

a tuple p corresponds to an attribute m E M, where M is the set of interval 

attributes. Thus, the size of any tuple in Dj is \M\, and for any attribute 

m G M we can denote the corresponding interval by m(p). We also denote the 

value set of m by W m . Since the set W m is totally ordered we also denote by 

Wm and Wm ^ the sets containing the first j (smallest) elements and the last 

j (largest) elements from W m , respectively. 

A projection chain for interval tuple data is formed in the same way as 

discussed in Example [5] We start from the projection containing only one 

pattern corresponding to the largest interval in each component, i.e., for an 

attribute m the projection is of the form Then to pass to 

a next projection, we select the attribute to, and for this attribute we extend 

the projection from ip m [Wm\Wm^] to ip m [Wm +1 \w^ 1 ' ) ]. Thus, there are 

k = maxlWml ■ \M\ projections. 
raEM 

Finding preimages in this case is not hard, since to make a projection more 
detailed one should just extend the corresponding interval in left and/or on right 
end of the interval, i.e., there are only 4 possible preimages for a pattern when 
passing from one projection to another in this chain. Thus, we have proved the 
following 

Proposition 1 The worst case complexity for d-Eofia algorithm for interval 
tuple data is 

T{G-Eo<pia interllals ) = max|W m | ■ \M\ ■ max \Pi\ • T (M). (5) 

m£M 0<Z<K 
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Table 1: Patterns found for every projection in a chain for the example in 
FigureU) Patterns are grey if they are removed for the corresponding projetion 
and they are labeled with ” if they have not yet been found. 


# 

Pattern Ext. 

V’o 

A-measure 
Ipl $2 

1p3 

1 

{gi, 92, 33,34,35,36} 

6 

2 

2 

2 

2 

{ 31 , 32 , 53 , 34 } 

- 

4 

1 

1 

3 

(35,36} 

- 

2 

2 

2 

4 

{3i} 

- 

- 

1 

1 

5 

{32,33,34} 

- 

- 

3 

2 

6 

{34} 

- 

- 

- I 

1 


3.5 'd-Socptoc Algorithm for Closed Patterns 

Closed frequent itemsets are widely used as a condensed representation of all 
frequent itemsets since [14] . Here we show how we can adapt the algorithm for 
closed patterns. A closed pattern in ^>j_i(P) is not necessarily closed in ipi(P). 
However, the extents of ip( P) are extents of P [5]. Thus, we associate the closed 
patterns with extents and then work with extents instead of patterns, i.e., a 
pattern structure P = (G, (D , n), <5) is transformed into Pc = (G, {Dc, IHc), Sc), 
where Dc = 2 G . Moreover, for all x,y £ Dc we have xHcy = (x°n y 0 ) 0 , where 
diamond operator is computed in P and 5c{g € G) = {g}. Hence, every pattern 
p in Dc corresponds to a closed pattern p° in D. A projection ip of P induces 
a projection r ipc of Pc, given by ipc{X C G) = ^(X 0 ) 0 with (-)° for P. 

3.6 A-measure and 'd-Xocpioc Algorithm 

In this subsection we show that A-measure is anti-monotonic for any projection; 
it is a stronger condition than the one required by Definition [3] A-measure 
works for closed patterns, and, hence, we identify every description by its extent 
(Subsection 13.511 . 

Proposition 2 A is anti-monotonic for any projection ip. 

Proof. By properties of a projection, an extent of ip{ P) is an extent of P [5|. Let 
us consider an extent E and an extent of its descendant in i/f(P). Let us suppose 
that E p is a preimage of E for the projection ip. Since E c and E p are extents in 
P, the set E cp = E C D E p is an extent in P (the intersection of two closed sets 
is a closed set). Since E p is a preimage of E , then E p E c (otherwise, E p is a 
preimage of E c and not of E). Then, E cp ^ E p and E cp < E p . Hence, A (E p ) < 
\E p \E cp \ < \E\E C \. So, given a preimage E p of E, i\/E c C E)A(E P ) < |£7\£7 C j, 
i.e., A (E p ) < A (E). Thus, we can use A-measure in combination with fl-Socpia. 
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3.7 Example of A-Stable Patterns in Interval Tuple Data 

Let us consider the example in Figure [I] and show how we can find all A-stable 
patterns with a threshold 9 = 2. The chain of projections for this example is 
given in Example [5j it contains 4 projections: 


^ 0 = V’mRl 0},{l}]m 2 [{0},{2}] 

t/>2 = ti[{0,l},{0,l}]m 2 [{0,l(,{l,2}] 


V’l = V , mi[{0,l},{0,l}]m 2 [{0},{2}] 

^3 = ^mi[{0,l},{0,l}]m2[{0,l,2},{0,l,2}] 


Since we are looking for closed patterns, every pattern can be identified by 
its extent. In Table Q] all patterns are given by their extents, i.e., by elements 
of Dc■ For every pattern A-measure is shown for every ipi. A cell is shown in 
grey if the pattern is no more considered (the value of A less than 2). A cell has 
a dash if a pattern in the row has not been generated for this projection. 

For the example in Figurc[l]the global process is as follows. At the beginning 
tpo(Dj) contains only one element corresponding to pattern extent 123456 (a 
short cut for { 31 , 52 , 33 , 54 , 3 . 5 , 56 }) with a description ([0,1]; [0, 2]). Then, in 
rpi(G, (Dj 7 n), S) possible preimages of 123456 are patterns with descriptions 
(0; [0, 2]) and (1; [0, 2]) given by pattern extents 1234 and 56, respectively. Then 
we continue with these three patterns which are all A-stable for the moment. 
The pattern extents 123456 and 56 have no preimages for the transition ip 1 —>• 
ip 2 , while the pattern extent 1234 has two preimages with descriptions (0; [0,1]) 
and (0; [1, 2]) for this projection, which correspond to pattern extents 1 and 
234. The first one is not A-stable and thus is no more considered. Moreover, 
the pattern extent 1234 is not A-stable (because of 234) and should also be 
removed. Finally, in transition ip 2 —> V*3 on ly extent-pattern 234 has a preimage, 
a pattern extent 4, which is not A-stable. In such a way, we have started from 
a very simple projection ipo and achieved the projection ipz that gives us the 
A-stable patterns of the target pattern structure. 

4 Experiments and Discussion 

In this section we compare our approach to approaches based on postfiltering. 
Indeed, there is no approach that can directly mine stable-like pattern, e.g., 
stable, A-stable or robust patterns. The known approaches use postfiltering to 
mine such kind of patterns mmmm- Recently it was also shown that it 
is more efficient to mine interval tuple data without binarization [S|. In their 
paper the authors introduce algorithm MinlntChange for working directly with 
interval tuple data. Thus we compare D-Eocpia and MinlntChange for finding 
A-stable patterns. We find A-stable concepts with D-Eocpioc and then adjust 
frequency threshold 9 such that all A-stable patterns are among the frequent 
ones. 

The experiments are carried out on an “Intel(R) Core(TM) i7-2600 CPU @ 
3.40GHz” computer with 8Gb of memory under Ubuntu 14.04 operating system. 
The algorithms are not parallelized and are coded in C++. 
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4.1 Dataset Simplification 

For interval tuple data stable patterns can be very deep in the search space, 
such that neither of the algorithms can find them quickly. Thus, we join some 
similar values for every attribute in an interval in the following way. Given a 
threshold 0 < /3, two consequent numbers Wi and 1 / 7+1 from a value set W 
are joined in the same interval if Wi+\ — Wi < /3. In order to properly set the 
threshold /3, we use another threshold 0 < 7 < 1, which is much easier to set. 

If we assume that the values of the attribute m are distributed around several 
states with centers w 1 ,--- ,w l , then it is natural to think that the difference 
between the closest centers abseil ; 1 — w l±1 ) are much larger than the difference 
between the closest values. Ordering all values in the increasing order and 
finding the maximal difference <5 max can give us an idea of typical distance 
between the states in the data. Thus, 7 is defined as a proportion of this 
distance that should be considered as a distance between states, i.e., we put 
/3 = 7 • dmax- If the distance between closest values in W are always the same, 
then even 7 = 0.99 does not join values in intervals. However, if there are two 
states and the values are distributed very closely to one of these two states, 
then even 7 = 0.01 can join values into one of two intervals corresponding to 
the states. 

4.2 Datasets 

We take several datasets from the Bilkcnt University database 0- The datasets 
are summarized in Table [2] The names of datasets are given by standard ab¬ 
breviations used in the database of Bilkcnt University. For every dataset we 
provide the number of objects and attributes and the threshold 7 for which 
the experiments are carried out. For example, database EM has 61 objects, 9 
numeric attributes, and the threshold 7 is set to 0.3. Categorical attributes and 
rows with missing values, if any, are removed from the datasets. 

4.3 Experiments 

In Table [2] we show the computation time for finding the best A-stable pattern 
(or patterns if they have the same value for A-measure) for D-Eocpia and for 
MinlntChange. The last algorithm is abbreviated as MIC. Since MinlntChange 
algorithm sometimes produces too many patterns, i.e., we do not have enough 
memory in our computer to check all of them, we interrupt the procedure and 
show the corresponding time in grey. We also show the number of the best 
patterns and the corresponding threshold A. The support threshold 9 for finding 
the best A-stable patterns is also shown. For example, dataset CN contains 5362 
best A-stable patterns, all having a A of 2. To find all these patterns with a 
postfiltering, we should mine frequent patterns with a support threshold lower 
than 30 or ^ = 30%. D-Eocpia computes all these patterns in 2.4 seconds, while 

Http://funapp.cs.bilkent.edu.tr/DataSets/ 
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Table 2: Runtime in seconds of Eocpia and MinlntChange for different datasets. 


DS 

# Objs 

# Attrs 

7 

A 

# Ptrns 

e 

^•Socpia 

tmc 

EM 

61 

9 

0.3 

3 

3 

21 

< 0.1 

57 

BK 

96 

4 

0.3 

4 

50 

46 

< 0.1 

11 

CN 

105 

20 

0.8 

2 

5362 

30 

2.4 

28 

cu 

108 

5 

0.3 

5 

4 

27 

< 0.1 

1.5 

FF 

125 

3 

0.3 

6 

3 

48 

< 0.1 

1 

AP 

135 

4 

0.01 

5 

1 

19 

< 0.1 

34 

EL 

211 

12 

0.3 

6 

33 

83 

< 0.1 

34 

BA 

337 

16 

0.5 

4 

736 

91 

1.5 

32 

AU 

398 

7 

0.3 

7 

17 

234 

0.7 

73 

HO 

506 

13 

0.8 

10 

1 

340 

0.7 

57 

QU 

2178 

25 

0.3 

40 

1 

659 

1.3 

28 

AB 

4177 

8 

0.3 

46 

3 

1400 

11 

86 

CA 

8192 

21 

0.3 

85 

6 

2568 

112 

24 

PT 

9065 

48 

0.3 

2 

1 

2 

45 
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MinlntChange requires at least 28 seconds and the procedure was interrupted 
without continuation. 

As we can see, ff-Eocpia is significantly faster than MinlntChange in all 
datasets. In the two datasets CA and PT, MinlntChange was stopped before 
computing all patterns and the runtime did not exceed the runtime of D-Socpia. 
However, in both cases, MinlntChange achieved less than 10% of the required 
operations. 


5 Conclusion 

In this paper we have introduced a new class of interestingness measures that 
are anti-monotonic w.r.t. a chain of projections. We have designed a new 
algorithm, called D-Eocpia, which is able to efficiently find the best patterns 
w.r.t. such interestingness measures for interval tuple data. The experiments 
reported in the paper are the witness of the efficiency of the ff-Ecxpia algorithms 
compared to indirect approaches based on postfiltering. Many future research 
directions are possible. Different measures should be studied in combination 
with tf-Eocpia. One of them is robustness, which is very close to stability and 
can be applied to nonbinary data. Moreover, the choice of a projection chain is 
not a simple one and can affect the algorithm efficiency. Thus, a deep study of 
suitable projection chains should be carried out. 

Acknowledgments: this research was supported by the Basic Research Program at 
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